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ABSTRACT 


A method of sequencing a target nucleic acid using primer 
extension mass spectroscopy to generate an observed mass 
spectrum, on which base calling cycles are carried out using 
calibrated inter-peak mass difference values such that: each 
called base allows mass calculation for the base-called peak 
in the observed mass spectrum; this calculated mass is then 
used as a further calibration point for subsequent rounds of 
base calling. A reaction mixture of all four base-specific 
chain extension nucleotides and four chain termination 
nucleotide analogues. A method of performing mass 
spectroscopy, which method comprises subjecting molecu- 
lar ions which have been chemically charged in a predeter- 
mined manner to time-of -flight or Fourier transform mass 
spectroscopy. 

17 Claims, 3 Drawing Sheets 
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Fig.3. 


CALCULATE THE REGRESSION COEFFICIENT AND 
ITERCEPT FROM THE KNOWN AND OBSERVED 
MASSES OF 2 REFERENCE SPECIES 
{EXAMPLE-PRIMER AND PRIMER+1 RESIDUE) 
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In another aspect the invention provides, as a reagent 
suitable for use in this method though also having other uses, 
a reaction mixture containing all four base specific chain 
extension nucleotides which are not labelled so as to be 

5 distinguishable from one another, and four chain termination 
nucleotide analogues which are not labelled so as to be 
distinguishable from one another. 

In another aspect the invention provides, as a technique 
suitable for use in this method though also having other uses, 

10 a method of performing mass spectroscopy, which method 
comprises subjecting molecular ions which have been 
chemically charged in a predetermined manner to time-of- 
flight or Fourier transform mass spectroscopy. This aspect is 
described below under the heading "Chemically-created 

15 Molecular Ions". 

BRIEF DESCRIPTION OF DRAWINGS 

Reference is directed to the accompanying drawings in 
which: 

20 

FIG. 1 is a Mass Spectrograph, 

FIG. 2 is a Base Calling Program, and 

FIG. 3 is a Flow Diagram for the Program of FIG. 2. 

25 PRIMER EXTENSION MASS SPECTROSCOPIC 
(PEMS) SEQUENCING 

PEMS sequencing according to this invention differs from 
Sanger sequencing and sequencing by hybridisation in that 

3Q it generates extension products from primers (with all four 
dN I Ps and all four ddNTPs together in one reaction) and 
thence allows the sequence to be directly read by the 
observed mass difference between sequential peaks in a 
mass spectrum. Many hybridisation events may be spatially 

35 arrayed, extended in parallel and then sequentially analysed 
by sequential desorption/ionisation followed by mass spec- 
troscopy. 

With the PEMS method, the amount of sequence data 
obtained from each initial hybridisation is greater than that 


PRIMER EXTENSION MASS 
SPECTROSCOPY NUCLEIC ACID 
SEQUENCING METHOD 

This application is a 371 of PCT/B94/02527 filed Nov. 
17,1994. 

INTRODUCTION 

Sanger Sequencing: 

Conventional (Sanger) nucleic acid sequencing [1] relies 
upon the hybridisation of an oligonucleotide primer to target 
nucleic acid. A polymerase is then used to extend this 
complex into four separate nested sets of extension products 
terminating in dideoxy base analogues. These nested sets are 
resolved by size using gel electrophoresis. Various labelling 
and detection strategies can be employed to read the 
sequence from the base-specific nested sets. Conventional 
nucleic acid sequencing is highly skilled, very laborious and 
the rate of data acquisition is limited by the time consuming 
gel electrophoresis step. 
Sequencing by Hybridisation: 

Schemes for nucleic acid sequencing by hybridisation 
(SBH) [2] rely upon the stringent hybridisation of many 
oligonucleotide probes (such as all the possible 8 mers) to 
target nucleic acid. Although elegant, such schemes are 
critically dependent on the ability to discriminate perfect 
matches from mismatches. Such discrimination is currently 
far from a reality. SBH is a compromise between the size of 
the experiment (65,536 8 mers need to be synthesised to 
make the complete set) and the quality of the data used to 
reconstruct the target sequence. The longer the sequence 
data obtained from each hybridisation, the easier the target 
sequence is to reconstruct but the larger the number of 
hybridisations that need to be carried out. 

This invention directly addresses a great many of the 
abovementioned problems. 
The Invention 

In one aspect the invention provides a method of sequenc- 
ing a target nucleic acid by using primer extension mass 

spectroscopy to generate an observed mass spectrum 4Q G f j us t the oligonucleotide used for hybridisation. The 


wherein the observed mass spectrum is generated by the 
steps of: 

a) Hybridising at least one oligonucleotide primer to the 
target nucleic acid so as to form at least one primer- 
target hybrid, 

b) Subjecting each primer-target hybrid to polymerase 
catalysed chain extension conditions in the presence of 
four chain-extending nucleotides and of four chain- 
terminating nucleotide analogues, so as to generate a 
nested set of primer extension products, 

c) Subjecting each nested set of primer extension products 
to minimal fragmentation mass spectroscopy so as to 
generate the observed mass spectrum, 

wherein the following values are known with certainty: 
CNT**-, CM - dc ; CM" rfG ", CM"'' 7 ", CM"**-, CM"^", 
CM ddG -, CM - dJT ~, and CMO (the mass of the primer 
used), 

the calculated value CMO is used in order to calibrate 
inter-peak mass difference values (OIPMD) of the 
observed mass spectrum, 

base calling cycles are carried out using the calibrated 
OIPMD values such that: 

each called base allows mass calculation for the 
base-called peak in the observed mass spectrum, 

this calculated mass is then used as a further cali- 
bration point for subsequent rounds of base call- 
ing. 


amount of sequence data obtained from each initial hybridi- 
sation is potentially of the order of 100 or more bases 
(dependent upon the resolution of the mass spectrometer 
used for sequence determination). Furthermore, the rate of 
45 sequence acquisition with mass spectroscopy is potentially 
vastly greater than that possible using gel electrophoresis. 

The PEMS method gives rise to an observed mass spec- 
trum for the extension fragments. The present invention also 
relates to a method for determining the order of the bases 
50 from a PEMS sequencing observed mass spectrum. 

Sequence Reconstruction from Multiple Arrayed Hybridiza- 
tions: 

A number of oligonucleotide primers could be 
synthesized, hybridized to the target and extended into 

55 nested sets in separate reactions. The oligonucleotides could 
be arranged in a spatial array and simultaneously hybridized 
to the target. Similarly, the primer extension to generate 
nested sets from each oligonucleotide primer hybridized to 
the target could be simultaneously carried out for the whole 

60 array. Mass spectroscopic detection would then be sequen- 
tial for each nested set generated in the array. 

Because the amount of sequence generated from each 
oligonucleotide to target hybridization event in the PEMS 
method is greater than that of just the oligonucleotide used 

65 for hybridization, fewer oligonucleotide to target hybridiza- 
tion events are required in order to reconstruct the target 
sequence than are required in SBH schemes. In addition, the 
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number of gaps in the sequence (due to branch points) will 
be considerably less with PEMS sequencing than with SBH 
schemes. 

One further advantage of the PEMS method is the ability 
to walk along the target for gap filling in the sequence. 
Oligonucleotide primers corresponding to the end of the 
readable part of one round of PEMS sequencing can be 
synthesized (or formed by stacking two or more smaller 
oligonucleotides) and used for a subsequent round of PEMS 
sequencing. 

TERMINOLOGY 

Oligonucleotides: 

Oligonucleotides may be DNA or RNA based. They may 
contain one or more modified residue (modification may be 
in the base, sugar or backbone moiety or combinations of the 
above). They may also contain a non-phosphodiester back- 
bone (phoshorothioate, methylphosphonate and nylon back- 
bones are just some examples of non-phosphodiester 
backbones). Oligonucleotides may also have attached inter- 
calating groups (such as acridine derivatives) in order to 
stabilize their binding to target molecules. 
Target Molecules: 


The Method: 

For each combination of oligonucleotide primer and 
target, a hybridization reaction is carried out. Primer exten- 
sion is then carried out using a polymerase. The nested set 
5 of extension products generated by this process is then 
analyzed by mass spectroscopy, as described below, to give 
the sequence of bases extending from the oligonucleotide 
primer. 

Spatial isolation between each oligonucleotide primer and 
10 target hybridization event allows sequential analyses by 
mass spectroscopy and thus sequential sequence determina- 
tion to be facilitated. 

Matrix Assisted Laser Desorption Ionization (MALDI) [41], 
Matrix Assisted Laser Desorption Ionization Time-of- Flight 
15 Mass SpectroscoTY (MALDI-TOF MS) and Matrix 
Assisted Laser Desorption Ionization Fourier Transform 
Mass Spectroscopy (MALDI -FT MS) [5]: 

MALDI-TOF MS and MALDI-FT MS are tools for the 
mass determination of high molecular weight species. 
20 With MALDI, a laser pulse is used to desorb and ionize 
the species of interest from a matrix in which it is embedded. 
The desorption and ionization are very mild and produce 
molecular ions with very few fragmentation products. The 
very low level of molecular ion fragmentation allows the 


Target molecules may be DNA or RNA based rendered direct readout of ^ nce or residues m thc mass 

iiporanly, partially or wholly single stranded for the zt> . „ M * u„u^a:^*:„~ 


temporarily, partially or wholly singJ 
purposes of hybridization. They may be genomic DNA, 
YACS, PI clones, cosmids, lambda, plasmids, M13 clones, 
phagemids, PCR products or chemically synthesized oligo- 
nucleotides. 
Hybridization: 

Oligonucleotides are hybridized to target nucleic acid 
molecules. Hybridization may be in free solution, with 
oligonucleotide bound to a solid support or with target 
bound to a solid support. 

Hybridization and/or extension may be carried out within 
the matrix used for MALDI. Alternatively, Hybridization 
and/or extension may be carried out elsewhere followed by 


spectrum of a nested set originating from the hybridization 
primer. Any other desorption and ionization method could 
potentially be used provided that similarly low levels of 
molecular ion fragmentation could be achieved. Thus, des- 
30 orption and ionization need not necessarily be limited to 
matrix assisted laser pulsing. 

In TOF MS, the desorbed molecular ions are accelerated 
in a constant electric field and are detected at the end of a 
linear tube. The time difference between laser pulse desorp- 
35 tion and detection is proportional to the square root of the 
mass/charge for each molecular ion. 

In FT MS, ions are trapped by a combination of magnetic 
and electostatic fields. The trapped ions are induced into 
coherent cyclotron motion by a variety of pulsed excitation 


transfer to the matrix used for MALDI. 

Hybridization can be to single oligonucleotides or to 40 techniques. The time domain signal produced after this 

multiples of stacked oligonucleotides [3] (these stacks may excitation is then Fourier Transformed to give a spectrum of 

or may not be stabilized by DNA binding proteins, for abundance against mass/charge for each molecular ion. 

example single strand binding proteins). Desorption, ionisation and molecular ion fragmentation 

Primer Extension: are very much dependent upon: the nature of the matrix, the 

The polymerase for primer extension may be either an 45 concentration of the matrix, the nature of the species of 


RNA- or a DNA- dependent RNA or DNA polymerase. It 
may be either thermophilic or mesophilic and may be used 
with Mg, Mn or Mg+Mn based buffer systems. 

Generation of a nested set of termination products at all 
four bases can be achieved by three possible methods: 

1. Extension of the primer followed by controlled exo- 
nuclease digestion from either the 5' or the 3' end only. 

2. Extension of the primer under conditions where the 
concentration of one or more of the four dNTPs (or, 
more generally, chain extending analogues) are limit- 
ing. 

3. Extension of the primer in the presence of all four 
dNTPs (or, more generally, chain extending analogues) 
and all four ddNTPs (or, more generally, chain termi- 
nating analogues). In this way a nested set of all 
extension fragments over about 100-200 bases can be 
obtained from the primer. 

dNTPs and/or ddNTPs used for primer extension may be 
unmodified or they may be modified (either chemically or by 
isotopic composition) in order to improve the IPMDs (see 
below) and/or the generation of the nested set from the 
primer. 


interest, the concentration of the species of interest, the 
nature of the buffer, the concentration of various buffer 
components, the co -crystallisation of the matrix and species 
of interest and the laser power used for the pulse. The 
50 MALDI technique also generally requires a high ratio of 
matrix to species of interest thus compromising sensitivity. 
Much work is generally required in order to optimise all of 
the above factors. 

In addition to the above, the laser pulse gives rise to a 
55 cloud of molecular ions that do not all have exactly the same 
initial spatial locations or initial kinetic energies. The non- 
identical initial spatial locations and nonidentical initial 
kinetic energies of the molecular ions combine to limit the 
available resolution of the subsequent mass spectroscopy. 
60 The Inter Peak Mass Difference (IPMD): 

Let the mass of a given extension product r residues long 
be M,. 

Let the mass of the extension product r-1 residues long be 
M r _l 

65 The Inter Peak Mass Difference (IPMD,.) for residue r is 
defined as: 

(IPMD r )=(M r )-(M-l) 
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Let the mass increase to the extension product by adding 
an A-specific chain terminating residue at the 3' end of the 
extension product be MT ddA . 

Let the mass increase to the extension product by adding 
an C-specific chain terminating residue at the 3' end of the 
extension product be M' ddc . 

Let the mass increase to the extension product by adding 
an G-specifie chain terminating residue at the 3' end of the 
extension product be M~ ddG . 

Let the mass increase to the extension product by adding 
an T-specific chain terminating residue at the 3' end of the 
extension product be M' ddT . 

Let the mass added to the extension product by adding a 
A-specific chain extending residue between the primer and 
the 3' end of a chain terminated extension product be M" tM ". 

Let the mass added to the extension product by adding a 
C-specific chain extending residue between the primer and 
the 3' end of a chain terminated extension product be YA~ dC ~ . 

Let the mass added to the extension product by adding a 
G-specific chain extending residue between the primer and 
the 3' end of a chain terminated extension product be M dG . 

Let the mass added to the extension product by adding a 
T-specific chain extending residue between the primer and 
the 3' end of a chain terminated extension product be M~ dr ~. 

EXAMPLE 


primer-dA-dC-dG-dT-dA-dT-dG— 
dT-dG-dC-dA-dT-dA-dC 

Extension Product: 
piimer-dA-dC-dG-dT-dA-dT- 
ddG 

primer-dA-dC-dG-dT-dA-ddT 
piimer-dA-dC-dG-dT-ddA 
primer-dA-dC-dG-ddT 
p lime r- dA- dC- ddG 
piimer-dA-ddC 
piimer-ddA 
primer 


primer (synthesized strand) 
target nucleic acid 
(template strand) 
(IPMDj - (Mj) - (M^) 
IPMD r _ 7 = (M ddG ) + (M dT ) - (M~ d<lr ) 

IPMD r _ e = (M- d<rr ) + (M _dA ~) - (M ddA ) 
IPMD r _ s = (M" ridA ) + (M~ dT ~) - (M^ 1 * 1 ) 
IPMD I=4 = (M" d<n ) + (M- d °-) - (M ddo ) 
IPMD r=3 = (M- ddo ) + (M" dc -) - (M ddc ) 
IPMD r _ 2 = (M ddc ) + (M^) - (M _ddA ) 
IPMD,,! - (M ddA > 


For the simplest case, where dNTP and ddNTPs residues 
made up from 12C, 14N, 160, 31P, 1H, which have not been 
modified in any way are used, we have: 
M-ddA=297.0627 
M-ddC=273.0515 
M-ddG=313.0576 
M-ddT=288.0511 
M-dA-=313.0576 
M-dC-=289.0464 
M-dG-=329.0525 

M-dT-=304.0460 (the backbone phosphate is considered to 

be un-ionised for the purposes of this example), 
and the above values for IPMDs simplify to: 


primer-dA-dC-dG-dT-dA-dT-dG 

primer (synthesized strand) 

dT-dG-dC-dA-dT-dA-dC 

target nucleic acid 


(template strand) 

Extension Product: 

(IPMD,) = (H) " (M t _0 

primer-dA-dC-dG-dT-dA-dT-ddG 

IPMD r=7 = (M do ) 

piimer-dA-dC-dG-dT-dA-ddT 

IPMD r _ 6 - (M ^-) 

piimer-dA-dC-dG-dT-ddA 

IPMD r _ 5 - (M-^ ) 

primer-dA-dC-dG-ddT 

IFMD £=4 = (M _dT ~) 

piimer-dA-dC-ddG 

IPMD £ _ 3 - (M" dG -) 

piimer-dA-ddC 

IPMD r _ 2 = (M dc ) 

piimer-ddA 

IPMD r=1 = (M" ddA > 

primer 



The sequence of the extension product can thus be read 
directly from the values of IPMD r for the mass spectro- 
scopic peaks in the order of increasing mass from the primer. 


Several techniques can be used in order to improve the 
process of sequence reading from the observed mass spec- 
trum. 

Firstly, the ratio of signal to noise from the observed mass 
5 spectrum can be greatly improved by the summation of 
several spectra from the original sample. This is possible 
with MALDI if the sample surface area is larger than the 
area over which the laser pulse can be focussed. 
10 Secondly, statistical techniques can be used to calibrate 
the observed mass spectrum using known molecular masses. 
These techniques will now be described. 
The Calculated and Observed Inter Peak Mass Differences 
(CIPMD and OIPMD): 
15 Let the calculated mass of a given extension product r 
residues long be CMr. 

Let the observed mass of a given extension product r 
residues long be OMr 
20 Let the calculated mass of the extension product r-1 
residues long be CMr-1 

Let the observed mass of the extension product r-1 
residues long be OMr-1 
25 Let the calculated mass of the primer (extension product 
r=0 residues long) be CMO. 

Let the observed mass of the primer (extension product 
r=0 residues long) be OM0 

The Calculated Inter Peak Mass Difference (CIPMDr) for 
30 residue r is defined as: ( 

CIPMDr)=(CMr)-(CMr-l) 

The Observed Inter Peak Mass Difference (OIPMDr) for 
residue r is defined as: 
35 (OIPMDr)=(OMr)-(OMr-l) 

Let the calculated mass increase to the extension product 
by adding an A-specific chain terminating residue at the 3' 
end of the extension product be CM-ddA 

Let the calculated mass increase to the extension product 
40 by adding an C-specific chain terminating residue at the 3' 
end of the extension product be CM-ddC 

Let the calculated mass increase to the extension product 
by adding an G-specific chain terminating residue at the 3 1 
end of the extension product be CM-ddG 
45 Let the calculated mass increase to the extension product 
by adding an T-specific chain terminating residue at the 3' 
end of the extension product be CM-ddT 

Let the calculated mass added to the extension product by 
adding a A-specific chain extending residue between the 
50 primer and the 3' end of a chain terminated extension 
product be CM-dA- 

Let the calculated mass added to the extension product by 
adding a C-specific chain extending residue between the 
primer and the 3' end of a chain terminated extension 
55 product be CM-dC- 

Let the calculated mass added to the extension product by 
adding a G-specific chain extending residue between the 
primer and the 3' end of a chain terminated extension 
product be CM-dG- 
60 Let the calculated mass added to the extension product by 
adding a T-specific chain extending residue between the 
primer and the 3' end of a chain terminated extension 
product be CM-dT- 

In the above example, where dNTP and ddNTPs residues 
65 made up from 12C, 14N, 160, 31P, 1H, which have not been 
modified in any way are used, the observed mass spectrum 
will consist of the following fragments: 
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Step 4: 


primer-dA-dC-dG-dT-dA-dT-dG 

primer (synthesized strand) 

JT- dG- dC-d A-dT- dA- dC - 



(template strand) 

Extension Product: 

OMr (OIPMDr) = (OMr) - 


(OMr - 1) 

piimer-dA-dC-dG-dT-dA-dT-ddG 

OM7 OIPMD7 

primer dA-dC-dG-dT-dA-ddT 

OM6 OIPMD6 

piimer-dA-dC-dG-dT-ddA 

OM5 OIPMD5 

primer-dA-dC-dG-ddT 

OM4 OIPMD4 

primer-dA-dC-ddG 

OM3 OIFMD3 

primer-dA-ddC 

OM2 OIPMD2 

piimer-ddA 

OM1 OIPMD1 

primer 

OMO 


For the following example, the following values are 
known with certainty: 

CM-dA-, CM-dC-, CM-dG-, CM-dT-, CM-ddA-, 
CM-ddC-, CM-ddG-, CM-ddT- and CMO (the mass of 
the primer used). 

Let us also assume that the first residue to be incorporated 
is also known from the sequence of the primer binding site. 
CM1 is therefore also known with certainty. 

The calculated values CMO and CM1 can be used in order 
to calibrate the observed mass spectrum. As bases are called 
from the calibrated OIPMD values, more masses can be 
calculated from the called sequence. More calibration points 
thus become available for the base calling algorithm. 

Working through the above example, we have: 
Step 1: 


Extension Product: 
primer-dA-dC-ddG 


OMr (OIPMDr) = (OMr) - (OMr - 1) 
OM3 OIPMD3 


OM3 appears at a given point in the mass spectrum (now 
calibrated using CMO, CM1 and CM2 as reference points) 

There are four possible values for IMPD3. These are 
CM-dA-, CM-dC-, CM-dG- and CM-dT- 

Comparison between the OIPMD3 (in the calibrated mass 
spectrum) and the calculated values CM-dA-, CM-dC-, 
CM-dG- and CM-dT- is used to assign the third base in the 
extension 

CM3 is now known with certainty. CMO, CM1 and CM2 
are also known with certainty. These four values therefore 
allow the observed mass spectrum to be more extensively 
calibrated in order to further increase its accuracy 
Step 5: 


Extension Product: 
Primer-dA-dC-dG-ddT 


OMr 
OM4 


(OIPMDr) = (OMr) - (OMr - 1) 
OIPMD4 


Extension Product: 
primer 


OMr 
OMO 


OMO appears at a given point in the mass spectrum. 

CMO is known with certainty and allows the observed 
mass spectrum to be calibrated. 
Step 2: 


OM4 appears at a given point in the mass spectrum (now 
25 calibrated using CMO, CM1, CM2 and CM3 as reference 
points) 

There are four possible values for IMPD4. These are 
CM-dA-, CM-dC-, CM-dG- and CM-dT- 

Comparison between the OIPMD4 (in the calibrated mass 
30 spectrum) and the calculated values CM-dA-, CM-dC-, 
CM-dG- and CM-dT- is used to assign the fourth base in the 
extension 

CM4 is now known with certainty. CMO, CM1, CM2 and 
CM3 are also known with certainty. These five values 
35 therefore allow the observed mass spectrum to be more 
extensively calibrated in order to further increase its accu- 
racy 
Step 6: 


Extension Product: OMr (OIPMDr) = (OMr) - (OMr - 1) 

primer-ddA OM1 OIPMD 1 


OM1 appears at a given point in the mass spectrum. 

CM1 is known with certainty and allows the observed 
mass spectrum to be calibrated. 

Two observed masses (OMO and OM1) and two calcu- 
lated masses (CMO and CM1) are thus known. 
Step 3: 


Extension Product: OMr (OIPMDr) = (OMr) - (OMr - 1) 

primer-dA-ddC OM2 OIPMD2 


OM2 appears at a given point in the mass spectrum (now 
calibrated using CMO and CM1 as reference points) 

There are four possible values for IMPD2. These are 
CM-dA-, CM-dC-, CM-dG- and CM-dT- 

Comparison between the OIPMD2 (in the calibrated mass 
spectrum) and the calculated values CM-dA-, CM-dC-, 
CM-dG- and CM-dT- is used to assign the second base in the 
extension 

CM 2 is now known with certainty. CMO and CM1 are 
also known with certainty. These three values therefore 
allow the observed mass spectrum to be more extensively 
calibrated in order to further increase its accuracy 


40 Extension Product: OMr (OIPMDr) - (OMr) - (OMr - 1) 

primer-dA-dC-dG-dT-ddA OM5 OIPMD5 


OM5 appears at a given point in the mass spectrum (now 
calibrated using CMO, CM1, CM2, CM3 and CM 4 as 
45 reference points) 

There are four possible values for IMPD5. These are 
CM-dA-, CM-dC-, CM-dG- and CM-dT- 

Comparison between the OIPMD5 (in the calibrated mass 
spectrum) and the calculated values CM-dA-, CM-dC-, 
50 CM-dG- and CM-dT- is used to assign the fifth base in the 
extension 

CM 5 is now known with certainty. CMO, CM1, CM2, 
CM3 and CM4 are also known with certainty. These six 
values therefore allow the observed mass spectrum to be 
55 more extensively calibrated in order to further increase its 
accuracy 
Step 7: 


60 Extension Product: OMr (OIPMDr) - (OMr) - (OMr - 1) 

primer-dA-dC-dG-dT-dA-ddT OM6 OIPMD6 


OM6 appears at a given point in the mass spectrum (now 
calibrated using CMO, CM1, CM2, CM3, CM4 and CM5 as 
65 reference points) 

There are four possible values for IMPD6. These are 
CM-dA-, CM-dC-, CM-dG- and CM-dT- 


Comparison between the OIPMD6 (in the calibrated mass 
spectrum) and the calculated values CM-dA-, CM-dC-, 
CM-dG- and CM-dT- is used to assign the sixth base in the 
extension 

CM6 is now known with certainty. CMO, CM1, CM2, 
CM3, CM4 and CM5 are also known with certainty. These 
seven values therefore allow the observed mass spectrum to 
be more extensively calibrated in order to further increase its 
accuracy 
Step 8: 
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Example 

The following oligodeoxyribonucleotides were synthe- 


Extension Product: 
piimer-dA-dC-dG-dT-dA-dT- 

dde 


OMi (OIFMDi) 
OM7 OIPMD7 


= (OMt) - (OMr - 1) 


OM7 appears at a given point in the mass spectrum (now 
calibrated using CMO, CM1, CM2, CM3, CM4, CM5 and 
CM6 as reference points) 

There are four possible values for IMPD7. 

These are CM-dA-, CM-dC-, CM-dG- and CM-dT- 

Comparison between the OIPMD7 (in the calibrated mass 
spectrum) and the calculated values CM-dA-, CM-dC-, 
CM-dG- and CM-dT- is used to assign the seventh base in 
the extension 

CM7 is now known with certainty. CMO, CM1, CM2, 
CM3, CM4, CM5 and CM6 are also known with certainty. 
These eight values therefore allow the observed mass spec- 
trum to be more extensively calibrated in order to further 
increase its accuracy. 


sised: 
ACGACG 
5 ACGACGT 
ACGACGTG 
AC G ACGTGT 
ACGACGTGTA 
ACGACGTGTAA 
ID ACGACGTGTAAA 
ACGACGTGTAAAA 
ACGACGTGTAAAAC 
ACG ACGTGTAAAAC G 
ACGACGTGTAAAACGA 
15 ACGACGTGTAAAACGAC 

1 pmol of an equimolar mixture of the above oligos in a 
volume of 1 ,wl was mixed with equal volumes of saturated 
aqueous 2,4,6-trihydroxyacetophenone and saturated aque- 
ous diammonium hydrogen citrate, air dried and laser 
20 desorbed/ionised using a Vested 1.2 m Benchtop Time of 
Flight Mass Spectrometer with 24 kV accelerating voltage 
and a pressure of 3.34x10" 7 Torr. 

79 laser shots were averaged and the observed masses 
were obtained after rough calibration of the instrument using 
25 protein calibration standards. The observed mass spectrum 
is shown in FIG. 1. 

The mass values for peaks in the observed mass spectrum 
were used as the inputs for the base calling program the 
results of which are shown in FIG. 2. As can be seen from 


In the above example, all available information has been . , n . , , , . 

used in order to calibrate the observed mass spectrum. The 30 the J"? base ca \ hn ^ J>oth calls the correct 

sequence of bases and gives a confidence value for the 
calling of each base. 

FIG. 3 gives a flow diagram for the base calling program 
the results of which are reported in FIG. 2. 
Base calling algorithm computer program. 

The steps below describe the algorithm functions used. 
The actual Microsoft Windows program output is shown in 
FIG. 2 and the function is shown in flow diagram form in 
FIG. 3. This program is an example only, and other ways of 
performing the functions such as calibration or confidence 
level calculation could easily be devised. 
Step 1 

Assume that the first two peaks in the mass spectrum have 
known base sequences and thus known masses. Using 


amount of data used for base calling thus increases with the 
length of sequence called. The present invention can in one 
embodiment use all available data for base calling 
(whereupon calibration of the observed mass spectrum actu- 
ally improves for base calling further from the sequencing 
primer). The present invention can also use only a selected 
number of bases prior to the base about to be called for 
calibration and base calling. For this latter process, a rolling 
local calibration can be carried out for each base called (with 
a fixed number of bases prior to the base about to be called 
being used for the calibration used for base calling). 

In the above example, CMO and CM1 were known with 
certainty and were used in order to initially calibrate the 
observed mass spectrum. Other peaks of known mass in the 


mass spectrum obvious to those skilled in the art could be 45 normal linear regression, calculate the regression coefficient 


chosen for the calibration (including calibration compounds 
of known mass not taking part of the PEMS sequencing 
reaction). 

The algorithm for calling the bases from the observed 
mass spectrum of a PEMS sequencing reaction could be 
written such that the confidence of base calling is measured 
by the closeness between the values for CM-dA-, CM-dC-, 
CM-dG-, CM-dT- and the calibrated OIPMD values. 

References 

[1] Sanger Sequencing: Molecular Cloning, A Laboratory 

Manual, 2nd edition, J. Sambrook, E. Fritsch and T. 

Maniatis, Cold Spring Harbour Laboratory Press, 1989. 
[2] Sequencing By Hybridization: Drmanac et al, Science, 

260, P 1649, (1993) 
[3] Stacked Oligos: J. Kieleczawa et al, Science 258, pl787, 

(1992). 

[4] MALDI: F. Hillenkamp et al, Anal. Chem., 63, pll93A, 
(1991). 

[5] Mass Spectrometry: D. Skoog & J. Leary, Principles of 
Instrumental Analysis, p420, Harcourt Brace Janovich, 
4th Edition (1992). 


and intercept of the regression line from the observed and 
calculated masses of the first two peaks. This calibrates the 
first part of the observed mass spectrum. 
Step 2 

50 Designate the next peak as peak n. Take the observed 
mass of peak n and use the regression coefficient and 
intercept (calibration) to calculate the mass. 
Step 3 

Determine the difference in calculated mass between 
55 peaks n and n-1. 
Step 4 

Compare the mass difference calculated in step 3 to the 
expected mass differences for the addition of an A, C, G or 
T residue to the chain. Provisionally assign the relevant base 
60 on the basis of whichever is closest. 
Step 5 

Determine a confidence level for the assignation of the 
base to the mass difference using the following procedure: 

The diagram below shows the expected mass differences 
65 for the addition of an A C, G or T residue. It also shows the 
mass at the mid point between expected mass and upper and 
lower points based on the mid point mass differences. 
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- 281.55 


As described in step 4, the base is provisionally assigned 
on the basis of whichever is closest. Thus if the mass 
difference is between 321.06 and 308.55, the base will be 
assigned as an A. 

The confidence level is assigned as the proportion of the 
difference between the relevant mid, upper or lower point 
and the A, C, G or T value. Thus a mass difference of 311.98 
will be assigned as an A with a confidence of 75.95% 
because it is 75.95% of the distance between 308.55 and 
313.06. 
Step 6 

Using a threshold value specified by the operator of the 
software, define whether the confidence level is such that the 
base is called unambiguously, called ambiguously or 
rejected. 
Step 7 

If the base has been accepted, recalculate the calibration 
using the observed mass and the mass calculated in step 2 as 
another data pair. If the base is rejected then continue to use 
the previous calibration. 

The number of data pairs used in the regression calcula- 
tion can be specified by the software operator, this can be 
designated as r. Thus only the data from peaks n-r to n (or 
0 to n) would be used. 
Step 8 

If there is another peak in the spectrum then return to step 

2. 

Chemically -created Molecular Ions 


20 


30 


ii) Synthesis of an Amino link 2 (Applied Biosystems) 
terminated primer followed by ammonia deprotection 
to give a primary amino group at the 5' end of the 
primer. Coupling of the primary amino group to the 
quaternary ammonium compound 2-aminoethyl trim- 
ethylammonium chloride using the amine-reactive 
homobifunctional coupling reagent DSS (Pierce 
Chemicals). 

iii) Synthesis of an Amino fink 2 (Applied Biosystems) 
terminated primer followed by ammonia deprotection 
to give a primary amino group at the 5' end of the 
primer. Coupling of the primary amino group to the 
quaternary ammonium compound 2-aminoethyl trim- 
ethylammonium chloride using the amine-reactive 
homobifunctional coupling reagent DMA (Pierce 
Chemicals). 

iv) Synthesis of a C6 thiol linker (Amersham 
International, RPN2112) terminated primer followed 
by ammonia deprotection to give a thiol group at the 5' 
end of the primer. Coupling of the thiol group to the 
quaternary ammonium compound 2-aminoethyl trim- 
ethylammonium chloride using the heterobifunctional 
coupling reagent Sulfo-LC-SPDP (Pierce Chemicals). 

v) Synthesis of a primer followed by ammonia deprotec- 
tion to give a hydro xyl group at the 5' end of the primer. 
Phosphorylation of the 5' hydroxyl group using T4 
polynucleotide kinase and ATP gamma thiophosphate. 
Coupling of the thiol moiety of the thiophosphate group 
to the quaternary ammonium compound 2-aminoethyl 
trimethylammonium chloride using the heterobifunc- 
tional coupling reagent Sulfo-LC-SPDP (Pierce 
Chemicals). 

In a second embodiment, each of the ddNTPs (or, more 
generally, chain terminating analogues) used for chain ter- 
mination in a sequencing reaction (either Sanger [1] or 
PEMS Sequencing) is covalently attached to a stably 
charged group. The stably charged group is 'stable' in that its 
charge is not dependent upon the local pH conditions. One 


ammonium group. The charge on the stably charged group 
may be either positive or negative. The charge may be either 
z+ or z-, where z is greater than or equal to 1. 

There are many ways of attaching a stable charge, such as 


In another aspect of this invention, molecular ions are 
created CHEMICALLY. These chemically-created molecu- 40 exam P le of sucn a ^ tabl y charged group is a quaternary 
lar ions are charged in a predetermined manner. The molecu- 
lar ions are then desorbed into a vacuum from a solid surface 
(under conditions that do not cause molecular ion 
fragmentation) for subsequent mass spectrometry. 

In one embodiment, the primer used for extension in a 4 5 a <i uatemar y ammonium group, to a ddNTP which will be 
sequencing reaction (either Sanger [1] or PEMS obvious to those skilled m the art. The following chemical 
Sequencing) is covalently attached to a stably charged group synthetic strategies will serve as just some of many possible 
at the 5' end (or internally, if this does not interfere with examples. 

hybridisation and/or polymerase extension from the primer/ i) Synthesis of a 3"-amino-l"-propynyl derivative of the 

template complex). The stably charged group is 'stable' in so ddNTP (e.g. 7-(3"-amino-l"-propynyl)-7-deaza-2 , ,3'- 
that its charge is not dependent upon the local pH conditions. 


One example of such a stably charged group is a quaternary 
ammonium group. The charge on the stably charged group 
may be either positive or negative. The charge may be either 
z+ or z-, where z is greater than or equal to 1 . 

There are many ways of attaching a stable charge, such as 
a quaternary ammonium group, to a primer which will be 
obvious to those skilled in the art. The following chemical 
synthetic strategies will serve as just some of many possible 
examples: 

i) Synthesis of an Amino link 2 (Applied Biosystems) 
terminated primer followed by ammonia deprotection 
to give a primary amino group at the 5' end of the 
primer. Coupling of the primary amino group to the 
quaternary ammonium compound betaine using the 
carbodiimide coupling reagent EDC (Pierce 
Chemicals). 


dideoxyadenosine triphosphate, 5-(3"-amino-l M 
propynyl)-2',3'-dideoxycytidine triphosphate, 7-(3"- 
amino-l"-propynyl)-7 deaza-2',3'-dideoxyguanosine 
triphosphate or 5-(3 ,, -amino-l"-propynyl)-2 , ,3'- 
dideoxyuridine triphosphate). Coupling of the 
3" -amino-l"-propynyl primary amino group to the qua- 
ternary ammonium compound betaine using the carbo- 
diimide coupling reagent EDC (Pierce Chemicals), 
ii) Synthesis of a 3"-amino-l"-propynyl derivative of the 
ddNTP (e.g. 7-(3 , '-amino-l"-p^opynyl)-7-deaza-2 , ,3 , - 
dideoxyadenosine triphosphate, 5-(3"-amino-l M - 
propynyl)-2',3'-dideoxycytidine triphosphate, 7-(3"- 
amino-l"-propynyl)-7 deaza-2',3'-dideoxyguanosine 
triphosphate or 5-(3"-amino-l"-propynyl)-2 , ,3'- 
dideoxyuridine triphosphate). Coupling of the 
3"-amino-l"-propynyl primary amino group to the qua- 
ternary ammonium compound 2-aminoethyl trimethy- 
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lammonium chloride using the amine-reactive homo- 
bifunctional coupling reagent DSS (Pierce Chemicals), 
iii) Synthesis of a 3"-amino-l"-propynyl derivative of the 
ddNTP (e.g. 7-(3 ,, -amino-l ,, -propynyl)-7-deaza-2',3'- 
dideoxyadenosine triphosphate, S^^amino-!"- 
propynyi)-2',3'-dideoxycytidine triphosphate, 7-(3 M - 
amino-l"-propynyl)-7 deaza-2',3'-dideoxyguanosine 
triphosphate or 5-(3"-amino-l"-propynyl)-2 , ,3 1 - 
dideoxyuridine triphosphate). Coupling of the primary 
amino group to the quaternary ammonium compound 
2-aminoethyl trimethylammonium chloride using the 
amine-reactive homobifunctional coupling reagent 
DMA (Pierce Chemicals). 
In a third embodiment, one of the dNTPs (or, more 
generally, chain extending analogues) used for chain exten- 15 
sion in a sequencing reaction (either Sanger [1] or PEMS 
Sequencing) is covalently attached to a stabley charged 
group. The stably charged group is 'stable' in that its charge 
is not dependent upon the local pH conditions. One example 
of such a stably charged group is a quaternary ammonium 
group. The charge on the stably charged group may be either 
positive or negative. The charge may be either z+ or z-, 
where z is greater than or equal to 1. 

There are many ways of attaching a stable charge, such as 
a quaternary ammonium group, to a dN TP which will be 
obvious to those skilled in the art. The following chemical 
synthetic strategies will serve as just some of many possible 
examples: 

i) Synthesis of a S'^amino-f'-propynyl derivative of the 


local pH conditions. One example of such a stably charged 
group is a quaternary ammonium group. The charge on the 
stably charged group may be either positive or negative. The 
charge may be either z+ or z-, where z is greater than or 
5 equal to 1. 

There are many ways of attaching a stable charge, such as 
a quaternary ammonium group, to a primer which will be 
obvious to those skilled in the art. The chemical synthetic 
strategies outlined above will serve as just some of many 
10 possible examples. 

In a fifth embodiment, one of the ddNTPs (or, more 
generally, chain terminating analogues) used for chain ter- 
mination is covalently attached to a stably charged group. 
This chain terminating analogue is then used for 3' end 
labelling of a nucleic acid molecule with terminal deoxy- 
nucleotidyl transferase. The stably charged group is 'stable' 
in that its charge is not dependent upon the local pH 
conditions. One example of such a stably charged group is 
a quaternary ammonium group. The charge on the stably 
20 charged group may be either positive or negative. The 
charge may be either z+ or z-, where z is greater than or 
equal to 1. 

There are many ways of attaching a stable charge, such as 
a quaternary ammonium group, to a ddNTP which will be 
25 obvious to those skilled in the art. The chemical synthetic 
strategies outlined above will serve as just some of many 
possible examples. 

In a sixth embodiment, one of the ddNTPs (or, more 
generally, chain terminating analogues) used for chain ter- 


dNTP (e.g. 7-(3"-amino-l"-propynyl)-7-deaza-2'- 30 mination is covalently attached to a stably charged group. 


deoxyadenosine triphosphate, 5-(3 M -amino-l"- 
propynyl^'-deoxycytidine triphosphate, 7-(3"-amino- 
l"-propynyl)-7 deaza-2-deoxyguanosine triphosphate 
or 5-(3" - amino -1" -pro pynyl) -2' -deoxy uridine 
triphosphate). Coupling of the S'^amino-f'-propynyl 35 
primary amino group to the quaternary ammonium 
compound betaine using the carbodiimide coupling 
reagent EDC (Pierce Chemicals). 

ii) Synthesis of a 3"-amino-l"-propynyl derivative of the 
dNTP (e.g. 7-(3 ,, -amino-l ,, -propynyl)-7-deaza-2'- 40 
deoxyadenosine triphosphate, 5-(3"-amino-l"- 
propynyl)-2'-deoxycytidine triphosphate, 7-(3"-amino- 
l"-propynyl)-7 deaza-2 -deoxyguanosine triphosphate 

or 5 -(3" -amino- 1" -propynyl)-2'-deoxy uridine 
triphosphate). Coupling of the 3"-amino-l"-propynyl 45 
primary amino group to the quaternary ammonium 
compound 2-aminoethyl trimethylammonium chloride 
using the amine-reactive homobifunctional coupling 
reagent DSS (Pierce Chemicals). 

iii) Synthesis of a 3 ,, -amino-l"-propynyl derivative of the 50 
dNTP (e.g. 7-(3"-amino-l"-propynyl)-7-deaza-2'- 
deoxyadenosine triphosphate, 5-(3 M -amino-l"- 
propynyl)-2'-deoxycytidine triphosphate, 7-(3"-amino- 
l"-propynyl)-7 deaza-2-deoxyguanosine triphosphate 


This chain terminating analogue is then used for recessed 3' 
end labelling ('fill-in' labelling [1]) of a partially double 
stranded nucleic acid molecule with a polymerase. The 
stably charged group is 'stable' in that its charge is not 
dependent upon the local pH conditions. One example of 
such a stably charged group is a quaternary ammonium 
group. The charge on the stably charged group may be either 
positive or negative. The charge may be either z+ or z-, 
where z is greater than or equal to 1. 

There are many ways of attaching a stable charge, such as 
a quaternary ammonium group, to a ddNTP which will be 
obvious to those skilled in the art. The chemical synthetic 
strategies outlined above will serve as just some of many 
possible examples. 

In a seventh embodiment, one of the dNTPs (or, more 
generally, chain extending analogues) used for chain exten- 
sion is covalently attached to a stably charged group. This 
chain extending analogue is then used for recessed 3' end 
labelling ('fill-in' labelling [1]) of a partially double stranded 
nucleic acid molecule with a polymerase and the other three 
dNTPs not attached to stably charged groups. The stably 
charged group is 'stable' in that its charge is not dependent 
upon the local pH conditions. One example of such a stably 
charged group is a quaternary ammonium group. The charge 


or 5-(3"-amino-l"-propynyl)-2'-deoxyuridine 55 on the stably charged group may be either positive or 


triphosphate). Coupling of the primary amino group to 
the quaternary ammonium compound 2-aminoethyl tri- 
methylammonium chloride using the amine-reactive 
homobifunctional coupling reagent DMA (Pierce 
Chemicals). 

In a fourth embodiment, The primer used for extension on 
a template is covalently attached to a stably charged group 
at the 5' end (or internally, if this does not interfere with 
hybridisation and/or polymerase extension from the primer/ 
template complex). This extension reaction may be part of a 
Polymerase Chain Reaction (PCR). The stably charged 
group is 'stable' in that its charge is not dependent upon the 


negative. The charge may be either z+ or z-, where z is 
greater than or equal to 1. 

There are many ways of attaching a stable charge, such as 
a quaternary ammonium group, to a dNTP which will be 
60 obvious to those skilled in the art. The chemical synthetic 
strategies outlined above will serve as just some of many 
possible examples. 

In an eighth embodiment, one of the dNTPs (or, more 
generally, chain extending analogues) used for chain exten- 
65 sion is covalently attached to a stably charged group. This 
chain extending analogue is then used for incorporation 
labelling during primer extension with a polymerase. The 
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stably charged group is 'stable' in that its charge is not 
dependent upon the local pH conditions. One example of 
such a stably charged group is a quaternary ammonium 
group. The charge on the stably charged group may be either 
positive or negative. The charge may be either z+ or z-, 5 
where z is greater than or equal to 1. 

There are many ways of attaching a stable charge, such as 
a quaternary ammonium group, to a dNTP which will be 
obvious to those skilled in the art. The chemical synthetic 
strategies outlined above will serve as just some of many 
possible examples. 

In an ninth embodiment, a single stranded nucleic acid 
fragment (which may be an oligonucleotide primer) is 
covalently attached to a stably charged group by any of the 
above methods. The stably charged single stranded nucleic 
acid fragment is then used in a hybridisation reaction with 15 
nucleic acid(s) (some or all of which contain sequence 
complementary to the single stranded nucleic acid 
fragment). After the hybridisation reaction, the stably 
charged single stranded nucleic acid fragment and its hybri- 
dised complement are covalently crosslinked (either by an 20 
exogeneous crosslinking reagent or by a crosslinking moiety 
on the stably charged single stranded nucleic acid fragment). 
The covalently crosslinked complex is then analysed by 
mass spectroscopy. The stably charged group is 'stable' in 
that its charge is not dependent upon the local pH conditions. 25 
One example of such a stably charged group is a quaternary 
ammonium group. The charge on the stably charged group 
may be either positive or negative. The charge may be either 
z+ or z-, where z is greater than or equal to 1. 

There are many ways of attaching a stable charge, such as 30 
a quaternary ammonium group, to a nucleic acid fragment 
which will be obvious to those skilled in the art. The 
chemical synthetic strategies outlined above will serve as 
just some of many possible examples. 

In each of the above embodiments, the charge on the 35 
phosphate backbone of the extension product can be 
removed by controlled chemical modification (such as alky- 
lation or arylation). 

In each of the above embodiments, the charge on the 
phosphate backbone of the extension product can also be 40 
removed by forming the ammonium salt of the phosphate 
backbone and then subjecting the extension product to a 
vacuum (wherein the ammonium ions protonate the 
phosphates, evaporate as ammonia and produce the acid 
form of the phosphate backbone). 45 

In each of the above embodiments, the extension products 
thus consist of a stably charged group covalently attached to 
the molecule of interest (charge may be either z+ or z-) and 
the chemically modified or acid form of the phosphate 
backbone (both of which are uncharged). All the extension 50 
products are thus molecular ions with the same charge on 
each molecular ion (either z+ or z— ). m/z for each species is 
thus directly related to mass. For the third and eighth 
embodiments above, conditions will need to be empirically 
determined such that the same number of charged dNTPs 55 
(preferably one) is incorporated in each extension product. 

These molecular ions may be applied to a solid surface, 
conveniently in solution in a volatile solvent which is 
afterwards removed by evaporation. They may be applied 
alone or in admixture with a matrix material. 60 

Desorption of these molecular ions into a vacuum from 
the solid surface is a prerequisite for mass spectroscopic 
analysis of the molecular ions. 

Two methods are envisaged for the desorption of these 
molecular ions: 65 

Localised heating brought about by laser, or other, irra- 
diation. 


Charge repulsion between the stably charged group and 
the solid surface to which it is attached. 

A combination of these two approaches can also be 
envisaged. 

In the latter method, the solid surface could be one of the 
electrodes for molecular ion acceleration in a TOF mass 
spectrometer. This would ensure that all of the molecular 
ions started out from known positions (i.e. the electrode 
surface, which could be planar and in the same plane a the 
TOF detector) and with identical initial kinetic energies. 
These two factors should combine to greatly improve the 
resolution of the mass spectrometer. 

We claim: 

1. A method of sequencing a target nucleic acid by using 
primer extension mass spectroscopy to generate an observed 
mass spectrum wherein the observed mass spectrum is 
generated by the steps of: 

a) hybridising at least one oligonucleotide primer to the 
target nucleic acid so as to form at least one primer- 
target hybrid, 

b) Subjecting each primer-target hybrid to polymerase 
catalysed chain extension conditions in the presence of 
four chain-extending nucleotides and of four chain- 
terminating nucleotide analogues, so as to generate a 
nested set of primer extension products, 

c) Subjecting each nested set of primer extension products 
to minimal fragmentation mass spectroscopy so as to 
generate said observed mass spectrum, 

wherein the following values are known with certainty: 
CM^", CMT dc -, CM- dG -, CM'**-, CM"^", CMr ddc ; 
CM~ ddG ~, CM- ddT % and CMO (the mass of the primer 
used), 

wherein the calculated value CMO is used in order to 
calibrate inter-peak mass difference values (OIPMD) 
of the observed mass spectrum, 

base calling cycles are carried out using the calibrated 
OIPMD values such that: 

each called base allows mass calculation for the peak 
in the observed mass spectrum corresponding to 
said called base, and this calculated mass is then 
used as a further calibration point for subsequent 
rounds of base calling. 

2. The method as claimed in claim 1, wherein the fol- 
lowing values are known with certainty: 

CM"' M \ CM"^-, CM~ dG ~, CM"^-, CM- ddA -, CM~ ddc ~, 
CM"^-, CM ddT -, CMO (the mass of the primer used), 
and CM1 (the mass of the first peak beyond the primer 
used), and 

the calculated values CMO and CM1 are used in order to 
calibrate OIPMD values of the observed mass spec- 
trum. 

3. The method as claimed in claim 1 wherein the follow- 
ing values are known with certainty: 

CM dA \ CM ^ CM" rfG ", CM^", CM"^, CM~ ddc ~, 
CM ddCf -, CM ddT , CMO (the mass of the primer used), 
and the masses (CMl-r) of the first through rth peak 
beyond the primer used where r is the number of 
residues of extension product from the primer, and 

the calculated value CMO, and CM1 through CMr are 
used in order to calibrate OIPMD values of the 
observed mass spectrum. 

4. The method as claimed in claim 1 wherein the follow- 
ing values are known with certainty: 

CM dA , CM dc , CM dG , CM ^ , CM ^", CM ^ C ", 
CM"^-, CM"^- and CMO (the mass of the primer 
used), and the mass of one or more calibration com- 
pounds (of known mass), and 


5,849,542 


17 


18 


the calculated values CMO and the calibration compound 
masses) are used in order to calibrate the OIPMD 
values of the observed mass spectrum. 

5. The method as claimed in claim 1 wherein all of the 
bases called are used for the calibration step. 

6. The method as claimed in claim 1 wherein a rolling 
subset of all of the bases called are used for the calibration 
step. 

7. The method as claimed in claim 1 wherein linear 
regression is used for the calibration step. 

8. The method as claimed in claim 1 wherein nonlinear 
regression is used for the calibration step. 

9. The method as claimed in claim 1 comprising the step 
of determining the confidence of base calling by measuring 
the closeness between the values for CM"^", CM~ dC ~, 
CM' dG -, CM dT - and the calibrated OIPMD values. 

10. The method as claimed in claim 1 wherein a plurality 
of oligonucleotide primers is hybridized in step a) to the 
target nucleic acid so as to form a plurality of primer-target 
hybrids. 

11. The method as claimed in claim 1 wherein an oligo- 
nucleotide primer comprises one or more residues selected 
from the group consisting of residues modified in the base, 
residues modified in the sugar, residues modified in the 
backbone and residues which have one or more attached 25 
intercalating groups. 


12. The method as claimed in claim 1 wherein each 
oligonucleotide primer used in step a) is attached to a solid 
surface. 

13. The method as claimed in claim 1 wherein each nested 
5 set of primer extension products in step c) is embedded in a 

matrix, from which it is desorbed and ionised by minimal 
fragmentation means of a laser pulse for analysis by mass 
spectroscopy. 

14. The method as claimed in claim 13 wherein the 
ID minimal fragmentation mass spectroscopy is time-of-flight 

mass spectroscopy or fourier transform mass spectroscopy. 

15. The method as claimed in claim 1 wherein the primer 
extension products, generated in step b) and subjected to 
minimal fragmentation mass spectrometry in step c) have 

15 been chemically charged in a predetermined manner. 

16. A reaction mixture containing all four base specific 
chain extension nucleotides which are not labelled so as to 
be distinguishable from one another, and four chain termi- 
nation nucleotide analogues which are not labelled so as to 

20 be distinguishable from one another. 

17. The reaction mixture of claim 16, wherein the base 
specific chain extension nucleotides are deoxynucleoside 
triphosphates and the chain terminating agents are dideoxy- 
nucleoside triphosphates. 
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